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ABSTRACT We have used a forward genetic screen to identify genes required for transgene silencing in KEYWORDS 

the mouse. Previously these genes were found using candidate-based sequencing, a slow and labor- mouse 

intensive process. Recently, whole-exome deep sequencing has accelerated our ability to find the causative epigenetics 

point mutations, resulting in the discovery of novel and sometimes unexpected genes. Here we report the forward genetic 

identification of translation initiation factor 3, subunit H (e/F3h) in two modifier of murine metastable screen 

epialieies {Mommes) lines. Mice carrying mutations in this gene have not been reported previously, and elF3h 
a possible involvement of elF3h in transcription or epigenetic regulation has not been considered. 



Forward genetic screens have been used to identify genes involved in 
position effect variegation (PEV) in Drosophila (Eissenberg and Reuter 
2009; Henikoff 1990; Schotta et al. 2003). PEV describes the varie- 
gated (red and white patches) eye phenotype caused by stochastic 
silencing of the white locus, when placed near pericentric heterochro- 
matin (Muller 1930). Many of the genes involved in PEV turned out 
to have a critical role in gene silencing across the genome, and their 
functions are generally conserved between eukaryotes (Fodor et al. 
2010). We have carried out a sensitized iV-ethyl-iV- nitrosourea 
(ENU) mutagenesis screen to identify genes involved in the establish- 
ment and maintenance of epigenetic state in the mouse. Our screen 
uses a GFP transgene that is expressed in red blood cells in a variegated 
manner due to stochastic transcriptional silencing. GFP expression is 
assessed by flow cytometry using one drop of blood. Approximately 
55% of erythrocytes express the transgene in the wild- type transgenic 
line (Line3, an FVB/N J- derived mouse strain). The percentage of cells 
expressing GFP and the mean fluorescence of expressing cells is highly 
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reproducible among inbred Line3 individuals (Blewitt et al. 2005). 
Offspring of ENU-treated males are screened for a shift in GFP ex- 
pression, and the screen is designed to detect both suppressors and 
enhancers of variegation (Blewitt et al 2005). This is a screen for 
dominant effects, and mutants identified are called modifiers of 
murine metastable epialieies (Mommes). The screen has identified 
known (Dnmtl, Smarca5, Hdacl, Bazlb, Trim28) and novel (Smchdl) 
modifiers of epigenetic reprogramming, all of which play critical roles 
in normal embryonic, fetal, or early postnatal development (Ashe 
et al. 2008; Blewitt et al. 2008; Chong et al. 2007; Whitelaw et al. 
2010). Here we report the identification of the mutated gene in 
MommeD12 and MommeD38. 

MATERIAL AND METHODS 
Mouse strains and ENU screen 

Procedures were approved by the Animal Ethics Committee of the 
Queensland Institute of Medical Research. The ENU screen was 
carried out in an FVB/NJ inbred line that carry a GFP transgene, as 
described previously (Blewitt et al. 2005). Mutant lines were main- 
tained in this background, and all experiments were carried out using 
MommeD mice five generations or more removed from the MommeD 
founder. 

Whole-exome deep sequencing 

The exomes of the two mutant lines, MommeD12 and MommeD38 y 
were captured using the SureselectXT Mouse All Exon version 1 kit 
(Agilent) and the SeqCap EZ Mouse Exome, version Beta 2 
[110603_MM9_exome_rebal_2_EZ_HXl] kit (Roche NimbleGen), 
respectively. Both the DNA capture and library preparation was 
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Figure 1 GFP expression profiles and mean fluorescence 
of expressing cells in MommeD12 and MommeD38. (A) 
FACS profiles of MommeD12 and MommeD38 mutants. 
Erythrocytes from three-week-old mice were analyzed by 
flow cytometry with a GFP-positive gate set to exclude 
99% of wild-type erythrocytes. In each case, the expres- 
sion profiles from one litter are displayed. The phenotyp- 
ically wild-type mice are shown in black and heterozygotes 
in red. The x-axis represents the erythrocyte fluorescence 
on a logarithmic scale, and the y-axis is the number of cells 
detected at each fluorescence level. Mean fluorescence 
was calculated using cells within the positive gate. Histo- 
grams depict only the GFP fluorescence channel. (B) 
Quantitative analysis of MommeD12 and MommeD38 ex- 
pression and mean fluorescence of expressing cells. Each 
mutant line has a significantly different expression profile 
to that of wild-type littermates, reproducible over many 
generations. Data were collected from at least six litters 
in each case. Student t-test; *P < 0.0001. 



carried out as outlined in the protocols supplied by Agilent (protocol 
version 1.1.1) and Roche/nimblegen (NimbleGen SeqCap User's 
Guide, version 1.0, Illumina optimized), except that a Bioruptor (Dia- 
genode) was used for fragmentation of the DNA. The Bioruptor was 
optimized to produce ^200-300 bp fragments, which was achieved by 
running for 3 X 10 min on the low setting with cycles of 30 s on and 
30 s off. The resulting libraries were sequenced at a final concentration 
of 10 pM in a single lane each for 2 x 60 cycles on a GAIIx (Illumina) 
using the TruSeq PE Cluster Kit version 2 and TruSeq SBS kit version 
5 GA (Illumina). 

SNP calling 

For each of the two exomes, putative ENU mutations were identified 
by comparing variant calls from a mutant with variant calls from 
a control exome. The control exomes were prepared in parallel using 



the same capture kits and sequenced in the same deep-sequencing 
run. The variant calls were obtained by first mapping the deep- 
sequencing reads to the mouse reference genome (build 37, mm9) 
with the program BWA, version 0.6.1 (Li and Durbin 2009), using the 
default settings. The output was coordinate- sorted using SAMtools 
version 0.1.17 (Li et ol. 2009), and PCR duplicates were eliminated 
using the program Picard MarkDuplicates version 1.48 (http://picard. 
sourceforge.net). The program SAMtools mpileup (Li et ol. 2009), 
with the settings -q 20 -r chrl5:40000000-68000000, was then used 
to generate pileup files from the linked region for the mutant and 
control exomes. Putative ENU mutations were finally identified with 
the program Varscan somatic version 2.2.8 (Koboldt et al 2012) using 
the settings -min-coverage 15 and -min-var-freq 0.3. As input, the 
pileup files from a mutant and a wild-type were used, and mutations 
were identified as substitution or indel variants that differed between 
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Figure 2 MommeD12 and MommeD38 have mutations in elF3h. (A) Schematic of the elF3h protein structure and positions of point mutations. 
The point mutation in MommeD12 causes skipping of exon 5. The MommeD38 mutation introduces a premature stop codon at amino acid 291 , 
a highly conserved region of the protein. (B) Quantitative real-time RT-PCR analysis for elF3h normalized to Hprt. The graph shows the mean ± 
SEM for four testes samples of each genotype. All reactions were performed in triplicate. Student t-test; *P < 0.05. 
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the two. Only a single heterozygous nucleotide substitution, subse- 
quently validated using Sanger sequencing, was found for each of 
the mutant lines. These point mutations were validated in over 50 
mutants and 50 wild- types in both cases. 

RNA isolation, cDNA analysis, and quantitative 
real-time RT-PCR 

Total RNA was extracted from various tissues using TRI reagent 
(Invitrogen). cDNA was synthesized from total RNA using Super- 
Scrip till reverse transcriptase (Invitrogen). Quantitative real-time 
PCR was performed with the Platinum SYBR Green qPCR Super- 
Mix -UDG (Invitrogen) with primers designed to span exon/intron 
boundaries (available on request). All reactions were performed in 
triplicate and normalized to HPRT and GAPDH. PCRs were run on 
a Viia7 (Applied Biosystems). 

RESULTS 

A shift in the percentage of cells expressing GFP was seen in 
MommeD12 and MommeD38 mutants. The percentage of cells was 
significantly lower than it was in wild-type mice, and the mutants 
were classified as enhancers of variegation (Figure 1). The level of 
mean fluorescence in the expressing cells did not differ between 
mutants and wild- types (Figure IB). To identify the mutated genes 
in MommeD12 and MommeD38, we generated a G2 mapping pop- 
ulation by back-crossing MommeD12 or MommeD38 mice to Line3C y 
a C57BL/6J congenic mouse strain homozygous for the GFP trans- 
gene. Mapping by traditional microsatellite and SNP analysis identi- 
fied a 3.3 Mb interval on chromosome 15 for MommeD12 (Figure SI, 
A and C). Exome deep sequencing and bioinformatic interrogation of 
the linked interval, identified a T-to-A transversion in the conserved 
polypyrimidine tract 10 bp upstream of the 3' splice site of intron 4 of 
eIF3h y a gene not previously considered to play a role in gene silencing 
(Figure 2A). Sequencing of the cDNA revealed that the mutation leads 
to skipping of exon 5 (Figure 2A), which results in an in-frame de- 
letion of 50 highly conserved amino acids in the MPN (Mprl-Padl- 
N-terminal) domain. 

An Illumina Golden Gate SNP genotyping assay was used to 
identify the linked chromosome for MommeD38. MommeD38 map- 
ped to an overlapping interval of 22 Mb (Figure SI, B and C). Sub- 
sequent exome deep sequencing and bioinformatic analysis of the 
interval identified a G-to-A transition in exon 7 of eIF3h. This muta- 
tion changes an arginine to a stop codon (Figure 2A). 

For both MommeD12 and MommeD38 y putative ENU-induced 
substitution or indel variants within the linked intervals were identi- 
fied by comparing exome variant calls generated from the two mutant 
lines with control exomes. The controls had been prepared in parallel 
using the same exome capture kits and on the same flow cell 
to minimize sample-to-sample variation. No other mutations were 
found in the intervals. 

Both mutations lead to reduced eIF3h mRNA (Figure 2B), suggest- 
ing that MommeD12 and MommeD38 are null alleles. These are the 
first mutations reported in this gene in the mouse, and we designated 
these alleles e IF3h MommeD12 and e iF3h MommeD38 . 

Heterozygotes for the e IF3h MommeD12 and e IF3h MommeD38 muta- 
tions were viable and fertile. However, for both e JF3h MommeD12 and 
e JF3h MommeD38 y heterozygous intercrosses produced no homozygous 
offspring at weaning, and timed matings revealed empty deciduas at 
E9.5 in ratios approximating that expected for the homozygous em- 
bryos (Figure 3A). Genotyping of the grossly normal embryos at E9.5 
revealed that these were either wild-types or heterozygotes. Inter- 
crosses between e IF3h MommeD12 and e IF3h MommeD38 heterozygotes 
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Figure 3 Embryo dissections and heterozygous intercrosses for 
MommeD12 and MommeD38. (A) Embryonic dissections of elF3h mu- 
tant mice. Embryonic dissections revealed no viable homozygotes at 
E9.5. All embryos were produced by natural matings, and detection of 
a vaginal plug was counted as embryonic day E0.5. The proportions of 
genotypes were compared with expected Mendelian ratios using a x 2 
test. (B) Heterozygous intercrosses of elF3h mutant mice analyzed at 
weaning. Tabulated data shows the number of observed mice and 
(percentage of total). 

produced no compound heterozygotes at weaning, as expected (Figure 
3B). Our results indicate that eIF3h is required for normal embryonic 
development in the mouse. This is consistent with the finding that the 
eIF3h homolog in zebrafish is required for early embryonic develop- 
ment (Choudhuri et al 2010). 

DISCUSSION 

eIF3h, together with two other nonconserved subunits (eIF3e and 
eIF3f) and three conserved subunits (eIF3a, eIF3b, and eIF3c), are 
thought to form the functional core of the mammalian eukaryotic 
translation initiation factor eIF3 (Masutani et al 2007). Although it is 
possible that eIF3h has been identified because of an effect on the 
translation of the GFP mRNA, rather than transcription of the trans- 
gene, we consider it unlikely. We do not see a shift in the GFP mean 
fluorescence in the expressing cells in the e JF3h MommeD12 and 
e JF3h MommeD38 mutants, suggesting no major defect in GFP protein 
turnover (Figure IB). 

Some eIF3 subunits have been shown to interact with the 26S 
proteasome (Dunand-Sauthier et al 2002; Hoareau Alves et al 
2002; Paz-Aviram et al 2008; Yen and Chang 2000), suggesting 
that eIF3 has functions in biological processes other than trans- 
lation initiation. Intriguingly, eIF3e, another nonconserved sub- 
unit of eIF3, has been shown to concentrate in the nucleus in 
mammalian cells in a cell cycle-dependent manner (Watkins 
and Norbury 2004), consistent with a role in the nucleus. This 
supports the hypothesis that eIF3h affects transgene expression 
via a role at the level of transcription. 

In summary, we have identified the first two alleles of eIF3h in 
an unbiased forward genetic screen for modifiers of epigenetic 
reprogramming in the mouse. We anticipate that these mouse 
mutants will provide useful tools for others to perform further 
studies on eIF3h function in the cell. Interestingly, human eIF3h 
has recently been considered as a candidate gene for Microcephaly- 
Thin Corpus Callosum syndrome, a rare recessive disorder (Halevy 
et al 2012). 
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